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1. Introduction 

An Egyptian fraction is a sum of reciprocals of distinct positive integers, so called because the 
ancient Egyptians represented rational numbers in that way. In an earlier paper, the author 
|| showed that every positive rational number r has Egyptian fraction representations where 
the number of terms is of the same order of magnitude as the largest denominator. More 
precisely, there exists a positive constant C(r) such that for every x that is sufficiently large 
in terms of r, there is a set £ of positive integers not exceeding x with J2neE 1/n = r and 
|£| > C(r)x, where |£| denotes the cardinality of £. 

Independently, Croot || considered the problem of determining which integers can be 
represented as Egyptian fractions with denominators not exceeding some bound x. He 
showed that any integer less than 

1 j f 9 , m \ Oglogx) 2 , , 

logx + 7- [- + o 1 J — — 1 

v 2 ' log x 

(where 7 is Euler's constant) can be so represented, an impressive feat considering that the 
sum of the reciprocals of all the integers up to x is only log £+7. The methods used in and 
H are to an extent quite coincident; one feature that they share is that the representations 
constructed in both cases use only integers satisfying some upper bound on the size of their 
prime factors (a restriction that at some point is inherent in the problem rather than just a 
flaw in the method). However, Croot's technical work in this regard is much sharper than 
that in || , as is attested to by the fact that the expression (]!]) is best possible (apart perhaps 
from the constant 9/2). 

The purpose of this paper is to combine Croot's techniques with the methods in the au- 
thor's paper || to sharpen the theorem established in that paper, and to apply these methods 
to two related questions concerning Egyptian fractions. We can improve the aforementioned 
result from 01 as follows: 



Theorem 1. Let r be a positive rational number. For every x that is sufficiently large in 
terms of r, there is a set £ of integers not exceeding x, such that J2ne£ ^/n = r and 

'X log log 2> 

logx 

Furthermore, this is best possible: the main term cannot be increased, nor can the error term 
be reduced. 



|£| >(1 _ Ol _ 0r (£»i). (2) 



Theorem |l] asserts the existence of Egyptian fraction representations with many terms 
relative to a predetermined bound on the maximal denominator; it is a slightly more delicate 
matter to turn the problem around, and seek Egyptian fraction representations with small 
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denominators relative to a predetermined number of terms. To be more concrete, for each 
positive integer t let us define 

' 1 

■K t (r) = {(zi, . . . , x t ) G Z* : x x > ■ ■ ■ > x t > 1, ^ — = r}, 

i=i Xi 

the collection of all sets of denominators in t-term Egyptian fraction representations of r, 
and 

M t {r) = inf{ Xl : (x 1 ,...,x t ) e K t (r)}, (3) 

the smallest integer x that is the largest denominator in a t-term Egyptian fraction represen- 
tation of r (unless no such t-term representation exists in which case M t (r) equals infinity). 
Among the open problems posed by Erdos and Graham [Q and also found in Guy H Sec- 
tion Dll] is to determine the asymptotic size of M t (l), and the question generalizes directly 
to M t (r). 

The set 3i 4 (r) might well be empty for small values of t. If we define to( r ) to be the least 
number of terms in any Egyptian fraction representation of r, then !K t (r) is nonempty for 
every t > t Q (r), since a representation with t terms can be converted into one with t + 1 
terms by "splitting" the term with largest denominator, using the identity 



n n + 1 n(n + 1) 

Therefore M t (r) = oo for t < to(r) and M t (r) < oo for t > to(r). (The only flaw in this 
argument arises when the largest denominator is n = 1, in which case the splitting identity 
does not yield distinct terms; and in fact, the rational number 1 has an Egyptian fraction 
representation with one term but no representation with two terms. For this reason, we 
make the convention that to(l) = 3.) 

The author's result from || implies that for each positive rational r, there are infinitely 
many values of t for which M t (r) <^C r t, and Theorem |T] improves the constant in this 
upper bound to the best-possible value of (1 — e _r ) _1 + o(l). With a little more work, 
however, we need not be content with establishing such a bound merely for infinitely many t. 
The following theorem completely resolves Erdos and Graham's question in terms of the 
asymptotic behavior of M t (r): 

Theorem 2. For all positive rational numbers r and all integers t > t (r), we have 
which is best possible. 

Here it is appropriate to note that recently, Croot 0] established the related result that 
any positive rational r has an Egyptian fraction representation whose denominators all lie 
in the interval [x, e r x + O r (xloglogx/logx)], if x is large enough in terms of r. This result 
implies Theorem [I], and in fact somewhat more: if we define 

M[{r) = mm{x t - x x : ( Xl ,. . .,x t ) G ftt(r)}, 

then just as Theorem p] implies that M t (r) = t/(l — e~ r ) + O r (t loglogt/ logt) for infinitely 
many t, so does Croot's result imply the stronger result that M' t (r) = t + O r (t loglogt/ logt) 
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for infinitely many t. This asymptotic expression for M' t (r) is also best possible; however, 
no result analogous to Theorem ^, valid for all t, is obtained for M' t {r). 

The methods used to establish Theorems [1| and ^ allow us also to address another problem 
posed in Ml. Erdos and Graham observe that a prime power can never be the largest 
denominator in an Egyptian fraction representation of 1, nor can a tiny multiple of a prime 
power; they ask whether the set of integers with this property has positive density or even 
density 1. The analogous question can be asked about those integers that cannot be the 
largest or second-largest denominator in an Egyptian fraction representation of 1, and so on 
(as is mentioned in [H]). 

We can generalize this problem to Egyptian fraction representations of any positive rational 
number r. For any positive integer j, let us define 

£>j(r) = {x G Z, x > r~ l : there do not exist x%, . . . , x t G Z, x\ > ■ ■ ■ > x t > 1 

with Y?i=i 1/xi = r and Xj = x}, 

the set of numbers that cannot be the jth-largest denominator in an Egyptian fraction 
representation of r. We exclude the integers x < r -1 from consideration because they can 
never be a denominator in an Egyptian fraction representation of r (except for the trivial 
representation when r is itself the reciprocal of an integer). The questions of Erdos and 
Graham then become whether £i(l) has positive density, what can be said about £i(l) H 
£2(1), and so on. 

In our primary theorem concerning these questions, we discover some information about 
the sets &j(r) for j > 2 that is perhaps quite surprising: 

Theorem 3. Let r be a positive rational number. The set £>j(r) is finite for any integer 
j > 2, and there exists an integer jo{r) such that £>j(r) is empty for all j > jo{r). 

One consequence of Theorem |3| is that only finitely many integers cannot be the second- 
largest denominator in an Egyptian fraction representation of 1; possibly {2, 4} is a complete 
list of integers (greater than 1) with this property. Another consequence is that every integer 
greater than 1 can be the jth-largest denominator in an Egyptian fraction representation of 
1, when j is sufficiently large; possibly this holds for every j > 3. It might be interesting 
to establish a bound for the largest such integer and subsequently determine the sets £2(1) 
and £3(1) precisely with the aid of a computer; however, we do not undertake these tasks 
herein. 

Because the prime factors of the denominator of r are the only primes that can possibly be 
the largest denominator in an Egyptian fraction representation of r, the set &i(r) is certainly 
infinite. However, we are able to answer Erdos and Graham's question of whether £i(r) has 
positive density in the negative; in fact, we can even establish the order of growth of -Ci(r). 
Let us define the counting function Li(r;x) of £i(r), 

Li(r;x) = #{1 < n < x: n G &i(r)}. 

Then we have the following theorem: 

Theorem 4. Let r be a positive rational number. The set £i(r) has zero density, and in 
fact, if x > 3 is a real number then 

x log log x , . , x log log x , . 

r <r Li(r; x) < r ° S . 6 
log x log X 
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The lower bound in the inequality (|6|) is a simple quantitative consequence of the observa- 
tion that tiny multiples of prime powers are elements of £<i(V), as we shall show in the next 
section, while the upper bound reflects the discovery that all elements of £->i{r) are of this 
form (the only ambiguity being the exact meaning of "tiny"). This discovery was probably 
known to Croot (certainly in the case where r is an integer), since the methods we will use 
to establish the inequality ([]) are to a large extent present in 0. 

The author would like to thank Ernest S. Croot III for enlightening conversations and 
for providing access to his manuscripts prior to publication, as well as John Friedlander for 
suggestions that greatly improved the presentation of this paper. The author also acknowl- 
edges the support of National Science Foundation grant number DMS 9304580 and Natural 
Sciences and Engineering Research Council grant number A5123. 

2. Reduction of Theorems [I] and || 

We begin by defining some notation that will be used throughout the paper. Hereafter p 
will always denote a prime and q will always denote a (not necessarily proper) prime power. 
As is standard, the function n(x) denotes the number of primes not exceeding x, and P(n) 
denotes the largest prime divisor of n. It is more convenient for our purposes, however, to 
consider the sequence of prime powers as more fundamental than the sequence of primes. 
Therefore, we will use 7r*(x) to denote the number of prime powers not exceeding x, so that 
7i*(x) = tt(x) + 7r(a; 1 / 2 ) + n(x 1 ^ 3 ) + ■ • • . We will also let P*{n) denote the largest prime power 
that divides n; for example, -P(12) = 3 but P*(12) = 4. (By convention we set P*(l) = 1.) 
Notice that if n is the least common multiple of I and m, then P*(n) = max{P*(/), P*(m)}; in 
particular, if a and b are coprime and a/b = a 1 /bi + a 2 /b 2 , then P*{b) < max{P*(&!), P*{b 2 )}. 
The statements of Theorems [I] and [| can be reduced to the following two propositions: 

Proposition 5. Let I be a closed subinterval of (0, oo). There exists a positive real number 
T(I) such that, for all integers t > T(I) and all rational numbers r = a/b G / such that 
P*{b) < t\og~ 22 t, there is a set &oft distinct positive integers such that J2n<=£ V n = r an< ^ 

max{ne£}<- — + OA— ■ — —J. 7 

1 — e r v log t ' 



Proposition 6. Let r be a positive rational number. There exists a positive constant 5(r) 
such that, for every real number x that is sufficiently large in terms of r , all sets £ of positive 
integers not exceeding x for which Y,ne£ 1/n = r satisfy 

x log log X 



|£| < (1 - e~ r )x - 5{r)- 



\ogx 



Let us first verify that Theorem [TJ follows from these two propositions. Let r = a/b be 
a positive rational number and x a real number that is sufficiently large in terms of r. Let 
C(r) be a large positive constant depending on r, and define / to be the length-zero interval 
/ = {r} and 

C(r)x log log 



t 



logx 
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The integer t will satisfy both t > T(I) and tlog -22 1 > P*(b) as soon as x is large enough 
in terms of r (since T(I) depends only on r), and so we can apply Proposition |5| to obtain a 
set £ of t distinct positive integers such that J2 n e£ l/n = r and 

max\n E £} < h O r ( — °^ °^ \ 

1 J l-e~ r V log* ' 

But if C(r) is chosen large enough, the right-hand side of this inequality will be bounded 
above by x by the definition (§) of t, and so the integers in £ do not exceed x. Also, since 
|£| = t, the definition of t certainly implies the lower bound (g), and therefore the first 
assertion of Theorem p] has been shown to follow from Proposition |5|. On the other hand, 
the remaining assertion that the lower bound (fj) is best possible follows immediately from 
Proposition ||. 

In the same manner we can deduce Theorem ||] from Propositions [5] and [f| Given a positive 
rational number r = a/b and an integer t > T({r}) that is so large that tlog~ 22 t > P*(b), we 
can apply Proposition | to obtain a set £ of t distinct positive integers such that J2n££ V n = 
r, whose largest term is at most t/(l — e~ r ) + O r (t log log log t). By the definition @ of 
M t (r), this establishes the upper bound implicit in the asymptotic formula fl5|) when t is 
sufficiently large in terms of r; but by adjusting the constant implicit in the O-notation if 
necessary, we see that this upper bound is valid for all t > to(r). On the other hand, if £ 
is any set of t distinct positive integers satisfying J2ne£ l/n = r whose largest element is x%, 
then Proposition || shows that 

t < (1 — e )Xi — o(r)- 



which implies that 



logxi 

when t is large enough in terms of r. Since M t (r) equals the smallest such x±, we see that 
the right-hand side of the inequality @ is also a lower bound for M t (r). This argument 
shows that Theorem |2| in its entirety follows from the two propositions. 

The reader will have noticed that although Proposition || is stated uniformly for certain 
rational numbers in the interval /, no use was made of this in deducing Theorems [l] and |[ 
However, we will need the uniformity present in Proposition || in the proofs of Theorems ||] 
and it is for this reason that we take the time to establish the proposition in its current 
form. 

It turns out that the construction used to establish Proposition || proceeds in two stages 
which, although similar in spirit, require quite different subsidiary lemmas to complete. For 
this reason, we reduce Proposition [| to the following two propositions: 

Proposition 7. Let I be a closed subinterval of (0,oo). For any real number x that are 
sufficiently large in terms of I, any rational number r G / whose denominator is not divisible 
by any prime power exceeding x log -22 x, and any integer R satisfying 

r\ _-t^ 3r^xloglogX n ^ T X log log X 



e x 



(22(l-e-n--) X T — ' ~R <~ 7 g , (10) 
- p r ' log x e r log x 



there exists a set 51 of integers satisfying: 
(i) 11 is contained in [x/2e r ,x]; 
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(ii) = R; 

(iii) if — = r — — in lowest terms, then < — < 1 and P*(b) < x 1//5 . 

v ; b hgx b K ' ~ 



(As one might think, the constants in the inequality (|10l) , other than the initial (1 — e r ), 
are somewhat arbitrary and chosen simply for convenience during the proof.) 

Proposition 8. Let y be a sufficiently large real number, and let a/b be a rational number 
satisfying 1/ logy < a/b < 1 and P*(b) < y. Then there is a set S of integers satisfying: 

(i) § is contained in [l,2y 4 ]; 

(ii) |S| = 27r*(y); 



in 



U/ x ^ 1 

b~ ~ n' 

n6§ 



To see how Propositions [7| and |8] together imply Proposition |, we fix a closed interval 
/ C (0, oo), an integer t > T(I) where T(I) is a positive constant that is sufficiently large 
in terms of /, and a rational number rGl whose denominator is not divisible by any prime 
power exceeding tlog~ 22 t. We define 

t / 22 -(22 + 3r)e-^tloglogt 

X (l_ e -r)2 J i ogt ^ 

and i? = t — 27T*(x 1 / 5 ). When these values of x and i? are substituted into the inequality fllOD, 
the left-hand side has order of magnitude t(log logt) 2 / log 2 1 after simplification, while the 
right-hand side has order of magnitude t log log tj log t; therefore the inequality ( |i0|) holds as 
long as T(I) is large enough. Certainly x log -22 x > t log -22 1 as well if T(I) is large enough. 
We may therefore apply Proposition [7] to obtain a set 01 of integers and a rational number 
a/b satisfying properties (i)-(iii) of Proposition [7|. With this rational a/b, we may then apply 
Proposition || with y = x 1 ^ as long as T(I) is large enough (since x and y are functions of 
t), obtaining a set § satisfying properties (i)-(iii) of that proposition. 

We now set £ = ttUS. Because 2y 4 = 2x 4 / 5 < x/2e r if T(I) is large enough, it follows from 
the two properties (i) that 31 and S are disjoint and that the integers in £ do not exceed x. 
Moreover, the two properties (ii) imply that |£| = t, while the two properties (iii) imply that 
r = I] n e£ l/ n - By the definition (JTTJ) of x, we see that this set £ satisfies all the properties 
required for the conclusion of Proposition [5], and so that proposition does indeed follow from 
Propositions and || 

In a sense, we have separated the desire to have an Egyptian fraction with many terms 
from the desire to have a specific number of unit fractions that add to r. Proposition [7| yields 
an Egyptian fraction with many terms, but one whose sum is only an approximation to r; 
while Proposition § yields an exact Egyptian fraction representation of a/b with a specified 
number of terms, but with that number of terms rather small compared to the size of the 
largest denominator. 

The broad idea of the proofs of these two propositions uses the general strategy employed 
by the author in || and by Croot in : a collection of unit fractions whose sum is relatively 
close to the target rational number is constructed, and then for each prime power that 
appears in the denominator of this sum but not in the denominator of the target rational, 
a few terms are omitted or added so that the modified sum is no longer divisible by that 
prime power. When all of the unwanted prime powers have been eradicated in this way, 
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estimates on the number and sizes of the omitted or added terms are used to show that the 
resulting sum must exactly equal the target rational. Proposition is used to evict the larger 
unwanted prime powers, while Proposition [8] is used to evict the smaller prime powers. 

To summarize the achievements of this section, we have reduced Theorems [I] and ^| to 
establishing Propositions || 0, and These three propositions will be the subjects of Sec- 
tions [| |4], and [5], respectively. To complete the outline of the rest of this paper, we mention 
that Theorem |^ will be established in Section |6] and that Theorem [| will be established in 
Section [7|. 



3. The Very Large Prime Powers 

In this section we establish Proposition |6|, which was used to show that Theorems [3] and |2] 
are best possible. The strategy is to quantify the observation that a tiny multiple of a very 
large prime (or prime power) cannot appear in an Egyptian fraction representation of a given 
rational number, and then to calculate the effect that this restriction has on the possible 
number of terms in such a representation. 

Lemma 9. Let x be a positive real number. Suppose that x\,...,Xt are distinct positive 
integers not exceeding x and that p is a prime dividing at least one of the x,i. If p does not 
divide the denominator ofJ2t=i ^-/ x % expressed in lowest terms, then p< a;/ logx. 

We remark that a slightly modified version of this lemma could be established for prime 
powers q rather than merely for primes p, but this formulation suffices for our purposes. 

Proof: Because the integers Xi that are not divisible by p do not affect whether the denomi- 
nator of Yh=i l/ x i is divisible by p, we may assume that p divides all of the Xj. Set Wi = Xi/p 
for each 1 < i < t, and set A = \cm{wi, . . . , w t }. Then 



i=i x 



where each summand is an integer by the definition of A. We are assuming that the de- 
nominator of J2t=i l/ x i is n °t divisible by p when reduced to lowest terms. Consequently, 
if 

iv = E- = AE- (13) 

1=1 x l 1=1 w * 

is the numerator of the fraction on the right-hand side of equation (|T^) , then N must be a 
positive multiple of p; in particular, N > p and thus logiV > log p. 

On the other hand, the collection {wi} is a subset of the integers not exceeding x/p. If we 
define L(x) = lcm{l, 2, . . . , [x\ }, so that 



L(x) ] [ /> exp E l°gP I — e2x 

p v <x \p v <x I 



(14) 



by the prime number estimate of Chebyshev, then A < L(x/p) and so equation ( |i~3"D implies 

E -<i(-)(log- + l). 



1 w<x/p 1 ' 
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But then 



logp < logiV < logL(-) + 0( log log-) < - 

\p/ \ p> p 

by the estimate (fL4|), and so p C x/ logx as claimed. □ 

Next we establish an elementary lemma that provides asymptotic formula? for the number 
of integers free of very large prime (or prime power) factors and for the sum of the reciprocals 
of such integers. 

Lemma 10. Uniformly for sfx < y < x and < a < 1, we have 

P(n)>y 

and 

E I = .oga-Mog^ + 0(-^). 
~rv ti logw v alogx y 

P(n)>y 

Both formulas remain valid if P(n) is replaced by P*(n) in the conditions of summation. 
Proof: It is a direct consequence of Mertens' formula for J2 P <x that 

E - = ^T±7 + °(—J' ( l5 ) 

y<p<x 



p logy Mogy 



and the same asymptotic formula is true if the summation is taken over prime powers q 
rather than merely primes p. We note that any integer n < x such that P(n) > y can be 
written as n = mp where y < p < x, and this representation is unique since y > \fx. The 
first assertion of the lemma then follows from equation flT5| ) by writing 

E i= E E != E + 

ax<n<x y<p<x ax/p<m<x/p y<P<% " 

P{n)>y 

and invoking Chebyshev's estimate tc(x) <C x/logx to bound the error term. The second 
assertion follows in a similar manner from writing 

E -= E - £ -= E i(io ga -'+o(A)). 

ax<n<x y<P<% " ax/p<m<x/p V<P<% " 

P(n)>y 

Because the asymptotic formula fll5|) is insensitive to the inclusion of the proper prime 
powers, these arguments are equally valid when P(n) is replaced by P*(n) in the conditions 
of summation. □ 

Proof of Proposition ^ Suppose that r is a positive rational number, x is a real number 
that is sufficiently large in terms of r, and £ is a set of positive integers not exceeding x such 
that J2ne£ l/n = r. Let C > l be a large constant and define 

A = {n < x : P(n) < Cx/ \ogx). 
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We can assume that x is so large that all of the prime divisors of the denominator of r are 
less than Cx/logx. Then if C is chosen large enough, the set £ must be contained in A by 
Lemma [5[ 

Choose < a < 1 such that, if we set £' = [ax, a;] DA, then |£'| = |£|; in other words, £' 
is the subset of A with cardinality |£| whose elements are as large as possible. Then 

|£| = |£'l= Ei- E i 

ax<n<x ax<n<x 

P(n)>Cx/ logx 

= (i _ a)x + 0(1) - (1 - a )x log ( * og * -) + OU?-) 

Mog(Gx/ logx) 7 v logx 7 



by Lemma [10| with y = Cx/ logx. Since 

/ logx \ / loglogx logC\-i loglogx / 1 \ 

lo S(i — 77T71 r)=logU : + -j = — j + 0{- ). 

Mog(Cx/ logx) ' v logx logx 7 logx Mogx 7 

we see that 

| 8 | = (1 _ o)l ( 1 _!^5i£) + ( * ). (17) 



(16) 



On the other hand, the elements of £' are by definition at least as big as the elements of 
£, and so 

111 1 

^ = E->E-=E-- E - 

nGc n£c' ax<n<x ax<n<x 

P(n)>Cx/ logx 

- >o ga -> + 0(JL) - lo ga - log + o(J^ x 

again by Lemma [T0| with y = Cxj logx. Using equation (|l^) again, we see that 

^ , -if, loglogxx / 1 

r>loga 1 }+0 — 

v log x ' K a log x 

which implies that 

a > e - r (i - rl °g lQ g x + q ( 1 \) 
~ ^ logx Mogx 77 

With this lower bound, equation ([17|) becomes 



|£| < (! _ W, _ + O r ( J-))Wl - + (^-) 

v v logx v iogx y// v logx ' Mogx 7 

-r\ -vt-t \\ X loglogX / X \ 

= 1-e r )x- (1-e r l+r , 6 + CU; )■ 

log x v log X 7 

Since e r > 1 + r for r > 0, we may choose 5(r) satisfying < 5(r) < 1 — e~ r (l + r), whence 

, oi / i^ - n x / ^ log log g nfi x 

c. < (1 — e )x — d(r) — : (18) 

logx 

when x is large enough in terms of r. □ 
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4. The Large Prime Powers 

In this section we establish Proposition [7[ As mentioned in the introduction, the methods 
in this section are in large part derived from those in Croot H, albeit with some modifi- 



cations necessary for the problem at hand. In particular, our Lemma [12] below is a direct 
generalization of Proposition 2]. 

Let 1 1 denote the distance from x to the nearest integer. If a and n are coprime, then let 
a (mod n) denote the integer b with < b < n and ab = 1 (mod n). (Often we write simply 
a when the modulus is clear from the context, e.g., when the term a appears in the numerator 
of a fraction whose denominator is the modulus.) The following lemma demonstrates that 
under suitable conditions on a set of integers M, the elements of M cannot all be small 
compared to a modulus n and yet predominantly have inverses (mod n) that are close to 
(mod n), even when scaled by a nonzero residue h. 

Lemma 11. Let n be a sufficiently large integer, let k be a positive integer, and let B and C 
be positive real numbers with C satisfying 200(logn/ loglogn) fe < C < n. Suppose that M is 
a set of positive integers with cardinality greater than C , such that each element m of M is 
less than B and is the product of k distinct primes not dividing n. Then for any < h < n, 
at least C/2 elements m of M satisfy 

hfh C(loglogn) fc 



n 



200Slog fe n ' 



where m denotes the inverse of m (mod n) 



Proof: For m G M, define r m to be the integer satisfying — n/2 < r m < n/2 and r m 
li in (mod n); since n does not divide h and (m,n) = 1, we see than r m is nonzero. Also 
define 

mr m — h 
n 

so that s m is an integer satisfying \s m \ < m/2 + 1. Suppose that at least C/2 of the s m 
satisfied \s m \ < C(loglogn) fc /100 log fc n. Then, by the pigeonhole principle, there would be 
a particular value s with \s\ < C(loglogn) fc /1001og fc n such that s m = s for at least 

C/2 i logn 

> 20(- 



2C*(loglogn) fc /1001og fe n + 1 M og l ogra , 
of the elements m of M, by the lower bound on C . For each such m, we see that 

ns m + h ns + h 

'm j 

m m 

and since the r m are nonzero integers, we see that the nonzero integer ns + h is divisible by 
at least 20(logn/ log \ogn) k elements of M. 

On the other hand, it is well-known that the maximal order of the number of distinct 
prime divisors of an integer m is asymptotic to logm/ log log m, as achieved by those m that 
are the product of all the primes up to about logm; thus when m is sufficiently large, every 
integer up to m has less than 2 logm/ log logm distinct prime divisors. Since \ns + h\ < 
n(C(log logri) fc /100 log fc n + 1) < n 2 by the upper bound on C, we see that ns + h has at 
most 21ogn 2 / log log n 2 < 41ogn/ log log n distinct prime factors when n is sufficiently large, 
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and so the number of divisors of ns + h that are the product of k distinct primes does not 
exceed 



1 / 4 logn 
k\ Mog logn 



< 



20 ( 



logn 
log log n 



contradicting the lower bound for the number of elements of M that divide ns + h. 

This contradiction shows that at most C/2 of the s m satisfy \s m \ < C(loglogn) fc /100 log fc n, 
and so at least C/2 of the s m satisfy the reverse inequality. For these elements m, we see 
that 



ns m + h\ ^ n /C(loglogn) fc 
nA 1001og fc n 



> 



rn 



-1 > 



Cn(log logn)' 
2005 log fc n 



by the upper bounds for h and m and the lower bound for C. But then by the definition 
of r m . 



hm 






n 




n 



> 



C(log logn) fe 
2005 log fc n 



which establishes the lemma. 



□ 



The next lemma translates the statement of Lemma [TT| to an assertion that under suitable 
conditions on a set M, the inverses (mod n) of the elements of M must have a subset whose 
sum is congruent to any predetermined residue class (mod n). 

Lemma 12. Let n be a sufficiently large integer, let k be a positive integer, and let B be a 
real number satisfying 



B > 



(logn 



,(*-l)/2 



(log logn) fe / 2 ' 

Suppose that M is a set of integers whose cardinality C satisfies 

2005 2 / 3 (logn)( 2fc+1 )/ 3 



(19) 



C > 



(log logn) 2fe / 3 



(20) 



such that each element m of M is less than B and is the product of k distinct primes not 
dividing n. Then for any residue class a (mod n), there is a subset % of M such that 



y~] m = a (mod n) 

meOC 



(21) 



Proof: We begin by remarking, in preparation for applying Lemma |TT], that the hypotheses 
on B and C ensure that C > 200(logn/ log logn) k . Also, if C > n, then the conclusion of 
the lemma holds under the weaker assumption that each element of M is coprime to n, by 
the Cauchy-Davenport-Chowla Theorem (see for instance Vaughan |9], Lemma 2.14], and 
also H Lemma 2]). Therefore we can assume that C < n. 
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Let e n (x) denote the complex exponential e 2nlx ^ n of period n. If we let N be the number 
of subsets of M satisfying the condition (21), then by the finite Fourier transform 

N= -I> n (V E "i-tt)) =-Y,e n {-ha) J] (l + e n (hm)) 



OCcM 71 h=0 \ \ meOC / / 71 h=0 mGJVt 



2 C 1 

\- ~ V e n (-ha)P h , 

h=l 



(22) 



where we have defined P/j = IlmeJvtll + e n{hm)). Using the identity |1 + e lt \ 2 = 2 + 2 cost 
and the inequality 1 + cos27rf <2 — 8||t|| 2 , we see that 

, „ l9 -i r 27rhrh\ n -n / . hm 2\ 

w= n ( 2 + 2cos -^)< 4 n f 1 - 4 !^!)- (23) 



Tt ' 

meM m€M 



All of the terms in this product are nonnegative and bounded above by 1; and when n is 
sufficiently large, by Lemma |ll| at least C/2 of them are bounded above by 

'C(loglogn) fc \2 



i-4(: 



2005 log fc n 



when 1 < h < n — 1. Using this fact in the inequality fl23|) along with the bound 1 — t < e~ 



we obtain 



m ~ " iooooP 2 io g 2fc J ~ p v 2 ooooP 2 i og 2fc J p( ogn) 

by the lower bound ( p0|) on C, and thus jP^I < 2 c /ri when 1 < h < n — 1. 
From this upper bound, we deduce from equation (B2|) that 



}C i n—l i \nC 



AT 

n 



1 (77 - 1 )2 C 

<^£|P,|< )_ 



which implies that iV > 2 C /n 2 . In particular, there do exist subsets 3C of M of the desired 
type. □ 

We remark that a stronger inequality for B than (|19D might be needed to ensure the 
existence of a set M with the properties described in the statement of Lemma |12| (for instance, 
in the case k = 1). 

The following lemma is the main tool that will be used in our recursive construction in 
the proof of Proposition |7|. 

Lemma 13. Let < £ < 1 be a real number, and let x be a real number that is sufficiently 
large in terms of £. Let c/d be a rational number and define q = P*(d), and suppose that 
X V5 < q < xlog~ 22 x. Then there exists a set U of integers satisfying: 

(i) U is contained in [£x,x]; 

(ii) |U| < 200(x/g) 2 / 3 log 3 a;; 

(iii) for each element n of XL, P*(n) = q; 

d c 1 

(iv) if — = — — — in lowest terms, then P*(d') < q. 
y 1 J d' d ^.n ' y ' H 
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Proof: We apply Lemma [12] with n = q, k = 4, B = x/q, and a the residue class of 
c(d/q) (mod q). Let 7 be the set of all primes in the interval ((^x/q) 1 / 4 , (x/q) 1 / 4 ) that do 
not divide q, and let M be the set of all integers of the form P1P2P3P4, where the Pi are 
distinct elements of 7. Certainly each element of Mo is between and B. Since the 
cardinality of 7 is 3> (1 — £) (x/q) 1 / 4 log -1 x by the prime number theorem, the cardinality 
of Mo is ^ x/(glog 4 x), and so we can choose a subset M of cardinality 

C= L200(x/g) 2/3 log 3 a;J 

when x is sufficiently large in terms of £, by the upper bound on q. 

Since these values of B and C do satisfy the hypotheses ([19]) and ( p0|) of Lemma [12|, we 



can find a subset X of M such that J2m£X m = a ( m od q). Now define 

XL = {qm : m G 

We see immediately that properties (i) and (ii) hold for li. Since q > x 1//5 , all the elements 
of 7 (hence all prime-power divisors of elements of X) are less than q, and so property (iii) 
holds for 11 as well. As for property (iv), it is clear that P*(d') < q, since all of the prime 
powers dividing d or any of the elements of U are at most q. On the other hand, q does not 
divide d', since if q = p v then 

c'q _ cq ^ q c ^ 1 
d' d ~^,n d/q Ti*m 



= c(d/q) — m = (mod p), 

neM 

and so d' is divisible by at most p u ~ l after reducing to lowest terms. Thus P*(d') < q, which 
establishes the lemma. □ 

As a last step in the construction used to establish Proposition ^, we will be appending a 
collection of unit fractions none of whose denominators contain large prime-power divisors. 
The following lemma ensures that there are enough such integers in a suitable range to 
accommodate this. 

Lemma 14. Let < rj < 1 be a real number. There exists a constant 5 = S(rj) such that, for 
all real numbers x that are sufficiently large in terms of 77 and for all pairs of real numbers 
77 < a, e < 1, there are at least 5x integers n in the interval [ax/2, ax] satisfying P*{n) < x £ . 



Proof: We recall that a y-smooth integer is one all of whose prime factors are at most y, 
so that in particular, an integer n is x e -smooth precisely when it satisfies P(n) < x e . It is 
well-known (see for instance Hildebrand and Tenenbaum 0) that the number of y-smooth 
integers n < x is 

*pt$* 1 )+°'(r-) < 24 > 

v log y ' v log x ' 

uniformly for x v < y < x; here p(u) is the Dickman function, which is positive, continuously 
differentiable on [l,oo), and satisfies |p'( M )l — 1 011 ^ na ^ interval. In particular, 

/log ax \ / , loga \ ^ ( 1 \ 

v log x e ' v e log x ' v log x ' 
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uniformly for rj < a, s < 1. However, n being x £ -smooth is a slightly weaker condition than 
P*(n) < x e . The asymptotics for the counting function of x e -smooth integers could be shown 
to hold for the number of integers n < x satisfying P*{n) < x e as well, but since we only 
need a weak lower bound for the number of such integers we argue as follows. 

By equation (P^), using equation and the analogous statement with a replaced by 
a/2, we see that the number of x e -smooth integers between ax/2 and ax is ap(e~ 1 )x/2 + 
O v (x/ logx). On the other hand, the integers n that are x e -smooth but for which P*(n) > x e 
are all divisible by at least one prime power p v > x £ with p < x 6 . Given an integer k > 3, any 
integer n that is x £ -smooth but for which P*(n) > x £ must either be divisible by the kth power 
of some prime, or else by the square of some prime exceeding x £ ^ k . The number of integers 
ax/2 < n < ax that are divisible by the kth. power of a prime is a(l — ((k)~ 1 )x/2 + 0(x 1 / k ), 
while the number of integers n < x that are divisible by the square of a prime exceeding x s l k 
is at most 

v 4 + 1 « xl ' £/k - 

x e/k <p < x l/2 f 

Therefore, if we choose k so large that C(^) _1 > 1 — pij] X \ then the number of integers 
ax/2 <n < ax such that P*(n) < x e is at least 



( ap{£ ~ 1)x + - ( " (1 ~ c(fcrl)x + 0{x^)) - o(*w*) 



> Sx 



4ogx< 

for some constant 5 = 5(r]), as long as x is sufficiently large in terms of rj. □ 

Proof of Proposition 0: Let / = [m, M] be a closed subinterval of (0, oo), and let i be a 
real number that is sufficiently large in terms of /. Let r e / be a rational number whose 
denominator is not divisible by any prime power exceeding x log -22 x. Let 



a = e r , rj = min{e M , 1/5}, and £ = e m , 



so that 0<77<a<£<l. Since we are assuming that x is sufficiently large in terms 
of /, we can assume in particular that x is sufficiently large in terms of £ and rj when 
appealing to Lemmas [13] and [14]. Define A to be the set of all integers n in [ax, x] such that 
P*(n) < xlog~ 22 x, and set 



7T 



(xlog 22 x) and z' = tt*(x 1/5 ) 



Let {gi, q 2 , . . .} denote the sequence of prime powers in increasing order, and let pi denote 
the prime of which qi is a power. (By convention we set po = go — !)• 

Our strategy is to recursively define a sequence {aj/6j} (z + 1 > i > z') of rationals that 
increase in size as the index i decreases, such that the largest prime-power divisor of each hi is 
less than gj. The first member a z+ i/h z+ \ will be the difference between our original r and the 
sum of the reciprocals of the elements of A, and each at/bi will be obtained from the previous 
di+x/bi+i by adding several unit fractions whose denominators belong to A. The collection 
of all elements of A not involved in this construction will almost be the set 51 described in 
Proposition 0. This collection will have slightly fewer than the desired R elements, but we 
will rectify the error simply by appending the appropriate number of integers without large 
prime-power factors from the interval [ax/2, ax], and the resulting collection will be our 
set 31. 
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Define 

a-z+i 1 
= r — } — . 

We recursively define rationals {a z /b z , a z -i/b e -i, ... , a z > /b z >} and sets {0i z , . . . , 0i z /} of 
integers as follows. Suppose first that divides bf, then we apply Lemma [D| with c/d = 
a i+ i/b i+ i and q = qi- The lemma requires that = P*(&j + i), and since we are supposing 
that qi divides 6j we need only ensure that P*(6j + i) < g^. In the case i = z, the inequality 
P*(&i+i) < is equivalent to P*{b z+ i) < xlog~ 22 x, which is satisfied by the definitions of 
a z +i/b z +i and A and the hypothesis that the denominator of r is not divisible by any prime 
power exceeding xlog _22 x. On the other hand, the inequality P*(b i+ i) < will be satisfied 
for smaller values of % by the recursive construction (as we will see in a moment). Let CR^ be 



the set 11 obtained from applying Lemma 13, and let 



in lowest terms, so that by the lemma, P*(£>») < qi and thus P*(6«) < qi-\ (justifying the 
claim of the previous sentence). 

On the other hand, if q^ does not divide b i7 then we simply set 0^ = and ai/bi = a i+1 /b i+ i, 
since P*(b i+ i) < q, L by the recursive construction and does not divide b i+ i, we see that 



Notice that each Oli is a subset of A, since ^ is either empty or else (by Lemma [13|) is 
contained in [£x, x] C [ax, x], and each element n of % satisfies P*(n) = qi < x\og~ 22 x. 
Notice also that the various Oli are pairwise disjoint, again since P*(n) = qi for n G 3^. 

Now set Oi' = A \ \J z i=z > %, so that 

(here we have used the disjointness of the %). We note that the cardinality of A is 

1 

n 



\A\= £ 1 = (1 - a)x + O(l) - £ 

ax<n<i ax<n<x 
P*(n)<xlog- 22 x P*(n)>xlog- 22 x (26) 

22(1 - a)x log log x / x x 

= (1 - a x + Of- J 

log x v log X ' 

by applying Lemma |10| with y = xlog _22 x. On the other hand, 01' is a subset of A, while 
the set A \ OV is simply the union of all the %i and thus has cardinality 



u % 



z'<i<z 



Yl \%\ < 12 200(-) 2/3 log 3 x <xlog x 

z'<i<z x 1 / 5 <q<xlog- 22 x $ 



by Mertens' formula fllq ) and partial summation. Therefore the last expression in equation 
(p6|) represents the cardinality of 01' as well. 

By the hypothesis ( |I0D on R, we see that R exceeds the cardinality of 01', but by no more 
than (4aloga _1 )xloglogx/logx. Let 01" be any set of R— \0i'\ integers n from the interval 
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[ax/2, ax], each satisfying P*(n) < x 1//5 ; we can find such a set by Lemma [14] with e = 1/5, 
since 

, _ x x log logx 4xloglogx 

(4a log a ) — : < : < ox 

log x e log x 

when x is sufficiently large in terms of 5 = 5(i]) = 5(1). 

Now set ft = ft' U ft", so that ft is contained in the interval [ax/2, x] = [x/2e r , x] and the 
cardinality of ft is precisely R, and set 

a x 1 ay x 1 

— = r — y — = > — . 

6 ^-i n b z / *-± n n 

Since P*(b z >) < q' z < x 1 / 5 by the definition of z ; , and P*(n) < x 1 / 5 for all n G ft" by the 
definition of ft", we see that P*(b) < x 1//5 . It only remains to show that 1/logx < a/b < 1 
to establish the proposition. 
We have 

^ = r -J2- = log a" 1 - £ - + £ £"-£-• (27) 

From the definition of A, and by Lemma [TTj with y = a; log -22 a;, the first sum in the last 
expression of equation fl27|) is 

v- 1 1 i -i , n / 1 \ 22 log a" 1 log log x ( 1 
£ -- 2^ - = loga + Ol — J : + 0(— ■ 

P*(n)>x log -22 x 

The double sum in equation (^) is at most 

£ (200p) 2/3 lo g ^)±«-V, 
1/5 , -22 v V( r y a log x 



and the last sum in equation fl27|) is nonnegative and at most 

, . 2 (4aloga _1 )xloglogx 2 8 logo; _1 log log x 

(R - ft |) — < = . 

ax logx ax logx 

Consequently, equation (|27| ) implies the inequalities 



14 log a 1 log log x „ / 1 \ a 22 log a 1 log log x „ / 1 \ 

S + ) — T ~ S + ) 

logx v r]logx y o logx x -r)logx / 

(since a > rj), which certainly implies that 1/ logx < a/b < 1 when x is sufficiently large in 
terms of rj. This establishes Proposition [7]. □ 

5. The Small Prime Powers 

In this section we establish Proposition || We are now concerned more with having precise 
control over the number of terms in our Egyptian fractions than with sharply bounding the 
sizes of their denominators, as opposed to the case when we considered Proposition [7| in the 
previous section. While the lemmas in this section appear in Croot ||, we provide proofs 
for the sake of completeness and because we state the lemmas in somewhat different forms. 
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Lemma 15. Let p v > 5 be a power of an odd prime p, and let a be any integer. There 
exist integers mi and m-i, satisfying (p u — 3)/2 < mi < m 2 < p u and p \ mim 2; such that 
fhi + m 2 = a (mod p) . 

Proof: Assume first that p > 5. Consider the set M = {p u — (p + 3)/2 < m < p v } of 
(p + 3)/2 integers, none of which is a multiple of p. Define two sets of residues (mod p): 

Mi = {m : m G M} and M 2 = {a - m : m G M}, 

where m denotes the multiplicative inverse of m (mod p). Both Mi and M 2 have (p + 3)/2 
distinct elements (mod p), and each M, is a subset of the p residue classes (mod p), so by the 
pigeonhole principle there must be at least three residue classes m common to Mi and M 2 . 

For any such m, if we let mi = m and m 2 = a — m, then each mj is in M by the definitions 
of the Mj, and mi + m 2 = a (mod p). Furthermore, there is precisely one m (mod p), namely 
m = 2a (mod p), such that mi = m 2 (mod p) when defined this way. Therefore there is at 
least one pair (m 1) m 2 ) of distinct integers in M such that mi + m 2 = a (mod p v \ and we 
can assume that m x < m 2 by relabeling if necessary. Since p v — (p + 3)/2 > (p u — 3)/2, this 
establishes the lemma when p > 5. 

On the other hand, if p — 3 then we must have v > 2, and the lemma can be shown to 
hold by letting (mi, m 2 ) equal (3" - 2, 3* - 1), (3" - 4, 3 V - 1), or (3" - 5, 3" - 2), according 
to whether a is congruent to 0, 1, or 2 (mod 3). □ 

The following lemma is one of the two main tools used in our recursive construction in the 
proof of Proposition |8|. This lemma allows us to control all but the smallest prime powers 
that can appear in the denominators of the rational numbers to be constructed. 

Lemma 16. Let q > 4 be a prime power and let c/d be a rational number with P*(d) < q. 
There exists a set XL of integers satisfying: 

(i) IX is contained in [q 2 /5,q 2 ]; 

(ii) | IX | = 2 if ' q is odd, while \1L\ = or 1 if q is even; 

(iii) for each element n of U, P*(n) = q; 

d c 1 

(iv) if — = — — — in lowest terms, then P*(d') < q. 

a d ~t. n 

n£lL 



Proof: First assume that q is odd. We apply Lemma [15] with p u = q and 



c(d/q) (modp), if q divides d, 
(mod p), if q does not divide d 1 

finding two distinct integers m x and m 2 in the range [(q — 3)/2,q) such that fhi + m 2 = 
a (mod p). Let IX = {qm 1 ,qm 2 }. Then properties (i)-(iii) are easily seen to hold (the first 
because (q—3)/2 > g/5 for q > 5), and property (iv) holds because of the congruence (modp) 
satisfied by mi and m 2 . 

On the other hand, if q is even then q = 2 V for some v > 2. If 2 V does not divide d then 
P*(d) < q already, and we simply put IX = 0. If 2 V does divide d, then we easily check that 
the set IX = {2^(2^ - 1)} satisfies properties (i)-(iv). □ 



The following lemma is the second of the two main tools used in our recursive construction 
in the proof of Proposition |8|. This lemma allows us to control the smallest prime powers 
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that can appear in the denominators of the rational numbers to be constructed. Recall that 
L(x) = lcm{l, 2, . . . , [x\ }, as was defined during the proof of Lemma |]. 

Lemma 17. Let c/d be a rational number and define q = P*(d), where q is a power of the 
prime p. There exists an integer n satisfying: 

(i) L(q)/(p- 1) < n < e 2 «; 

(ii) P*{n) = q; 

del 

(iii) if — = — in lowest terms, then P*(d') < q. 

K ' d! d n K ' 



Proof: Define a to be the residue class of c(L(q)/q)(d/q) (mod p), where 1 < a < p — 1, 
and let n = L(q)/a. Then properties (i) and (ii) hold by the definition of L and subsequent 
remarks, while property (iii) holds by the choice of a. □ 

Proof of Proposition [§|: Define z = vr*(?/). Our strategy is very similar to the strategy 
of the proof of Proposition |7|. We recursively define a sequence {a^/^} (z + 1 > i > 1) of 
rationals that decrease in size as the index i decreases, such that the largest prime-power 
divisor of each hi is less than qi. The last member of this sequence, ao/bo, will be an integer, 
and we will show that it must be zero by bounding its absolute value. The first member 
a z+ i/b z+ i will be our original a/b, and each aijbi will be obtained from the previous aj + i/6j + i 
by subtracting two unit fractions (except in a few cases where we subtract only one or none 
at all), and the collection of the denominators of all these unit fractions will almost be the 
desired set S. This collection will have slightly fewer than the desired 2z elements, but we 
will rectify the error with a simple modification of the splitting identity (|j), and the resulting 
collection will be our set S. 

Define a 2+1 /6 2+1 = a/b and let y' = logy and z' = ir* (y'). We recursively define rationals 
{ a z/b z , ■ ■ ■ , ai/bi} and sets {§> z , . . . , §1} of integers as follows. If z' < i < z, then we apply 
Lemma |Io] with q = q% and c/d = ai + i/bi + \. The requirement of the lemma that P*(6j + i) < qi 



is satisfied for i = z by the hypothesis of the proposition and the definition of z, and it will 
be satisfied for smaller values of i by the recursive construction (as we will see in a moment). 



Let Si be the set U obtained from applying Lemma 16, and let 



h b i+1 ^ n 

in lowest terms, so that by the lemma, P*(bi) < qi and thus P*{bj) < q^\ (justifying the 
claim of the previous sentence). 

If instead 1 < i < z', then we check whether q^ divides &t+i- If n °t> then we simply set 
Sj = and a;/&; = a i+ i/b i+ i; since P*{b i+ i) < q^ by the recursive construction as before and 
since qi does not divide b i+ i, we see that P*{bi) < qi-\. On the other hand, if qi does divide 



b i+ i, then we apply Lemma |17| with c/d = a i+1 /6j +1 and q = qi. Let S« be the set {n} where 



n is the integer obtained from applying Lemma |17|, and let 

h b i+ i n 

in lowest terms, so that by the lemma, P*{bi) < qi and thus P*(bi) < qi-\. 
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Now set §' = Ui=i §i; so that 



a (Xq 1 
b bn ~i, n 

u rigs' 



We claim that in fact ao/b = 0. It is certainly an integer since its denominator 6 satisfies 
P*(bo) < Qo = 1 by construction, and it is less than 1 by the hypothesis that a/b < 1. 
Moreover, each element of each Sj is at least qf/5 if i > z', and at least L(qi)/(pi — 1) if 
% < z', since these sets resulted from applying Lemmas [16| and |17j, respectively. Therefore, 
since |Sj| < 2 if i < z' and |Sj| < 1 if i < z', 



6 & logy z ,^ z qf i<i<z' L(qi) 

i _ x ^ H _ v Pi - 1 
lo s?/ iogfe<» <? 2 ii< Z ' ' 

Since L(qi)/pi = L(gj_x), this last sum is a telescoping sum whose value is 1 — 1/L(qv), as 
is established by Croot [§, Lemma 1]; in particular, it is less than 1. Also, the penultimate 
sum is -C (logy log log y) -1 by Mertens' formula ([15]) and partial summation. Therefore 

r>r— ° d ri )-'>-' 

»o logy Mogy log logy' 

when y is sufficiently large. Therefore a /b = 0, and a/b = Z) n6 §/ 
Again by Lemmas [1^ and 17, the members of each Sj do not exceed 

e 2 ^, if l<.<^/- maX|2/ ' e 

In addition, if n is a member of Sj then P*(n) = qi, and so the Sj are pairwise disjoint; the 
cardinality of each Sj is 2, except when 1 < i < z' or when q^ is a power of 2, when the 
cardinality of Sj is or 1. Since there are -C logy of these exceptional values of i, we see 
that < 2z — |S'| <C logy. Let n be the largest element of §' and m — 2z — |S'|, and define 

§ = (§' \ {^}) U {n + m, n(n + 1), (n + l)(n + 2), . . . , (n + m — l)(n + m)}. 

Then the cardinality of S is exactly 2z = 2n*(y), and the largest element of S is 

(n + m - \){n + m) < (y 2 + O(logy)) 2 < 2y 4 

when y is sufficiently large. Moreover, since the identity 

1111 1 

+ ~, -r, ^7 + 



n n + m n(n + 1) (n + l)(n + 2) [n + m — l)(n + m) 

is valid for any positive integers m and n, we see also that J2 n ^$^/ n = XmeS 7 V n = a /b- 
Therefore S satisfies all of the properties required by Proposition M. 
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6. The Finiteness of &j(r) for j > 2 

In this section we establish Theorem |3]. For the reader's convenience we recall the definition 
of the sets £>j(r) under consideration: 

£>j(r) = {x G Z, x > r -1 : there do not exist Xi, . . . , x t G Z, x\ > ■ ■ ■ > x t > 1 

with Yh=i 1/xi = r and Xj = x}, 

so that £>j(r) is the set of numbers that cannot be the jth-largest denominator in an Egyptian 
fraction representation of r. We will make use of the following two lemmas, the first of which 
is a simple consequence of Proposition [5| stated in a more convenient form. 

Lemma 18. Let I be a closed subinterval of (0, oo). There exists a positive real number 
X(I) such that, for all real numbers x > X{T) and all rational numbers r = a/b G / for 
which P*(b) < xlog~ 23 a;, there is a set £ of positive integers not exceeding x such that 

Proof: This follows immediately from Proposition |5] if we set 

C(I)x\og \ogx- 



x 



logx 

where C(J) is a constant that is chosen so large that the right-hand side of the inequality ([?]) 
is less than x, and note that both t > T(I) and t log -22 1 > x log" 23 x will be true as long as 
X(I) is large enough. □ 

Lemma 19. There exists a positive constant k such that, for any integer k > ko, there 
exists a positive integer K = — 1 (mod k) such that P*(K) < /clog -24 k. 

Because of its length and technical nature, we defer the proof of Lemma [TP] until the end 
of this section. Assuming this lemma to be true, we may now proceed with a proof of 
Theorem || 

Proof of Theorem ^: We begin by showing that ^2( r ) is finite for any r. Let r = a/b be 
a positive rational number, and set J = [r/2, r]. Let k be any integer satisfying fclog -24 k > 
P*(b) and k > max{4/r, X(I), ko}, where X(I) and ko are the constants described in Lem- 
mas ITS] and ITS], respectively. We claim that there exists an Egyptian fraction representation 



of r whose second-largest denominator is k, and hence that k ^ <&2( r )- 

To see this, let K be a positive integer such that K = — 1 (mod k) and P*(K) < k log -24 k, 
as guaranteed by Lemma Let x = kj log A; and define the rational number r' = a'/b' by 



r T k Kk' 
We have that 1/k + 1/ Kk < 2/k < r/2, and hence r' is in /. Also, 

1 1 K +1 _ (K + l)/k 
k + Kk~ Kk ~ K ' 
where the numerator is an integer since K = — 1 (mod k), and so 

-24 r .. i -2:-S 



P*{b') < mzx{P*(b),P*(K)} < k\og~ M k < x\og 



x. 



Therefore we can invoke Lemma [18| with r' and x to produce a set £ of positive integers not 
exceeding x such that J2 n e£ V n = r ' ■> whence £ U {k, Kk} is the set of denominators for an 
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Egyptian fraction representation of r with second-largest denominator equal to k. Therefore 
k is not an element of £2(7"); and since this argument holds for all k that are sufficiently 
large in terms of r as specified above (note that the constant X(I) depends only on r), we 
have shown that £2(7") is finite. 

Now that we know that £2(7") is finite, we can establish Theorem |3| in its full strength. 
First, notice that if there exists an Egyptian fraction representation of r with the integer n 
as its jth-largest denominator (J > 2), then by splitting the term with largest denominator 
using the identity (^), we easily obtain an Egyptian fraction representation of r with n as its 
(j + l)st-largest denominator. This shows that £2(7") D £3(7") D £4(7™) !>•••• In particular, 
since £2(7") is finite, it follows that all of the £j(r) are finite for j > 3. 

Furthermore, for every element n of £2(7"), we can find an Egyptian fraction representation 
of r — 1/n using only denominators exceeding n (there are many ways to do this — one could 
use a greedy algorithm, for example) . If r — 1/n = l/rii+ • • --Hl/n^-i is such a representation, 
then the representation r = 1/ni + • • • + l/n^i + 1/n of r shows that n ^ £j(^). Since we 
can find such an integer j for each n 6 £2(7"), and since the £j(r) form a nested decreasing 
sequence of sets each contained in £2(0, we see that at some point the sets £,-(r) will be 
empty. This completes the proof of Theorem |] (modulo the proof of Lemma [H]). □ 

We remark that our proof does not show that £2(7") C £1(7"), and indeed this is false in 
general — in fact it is not even the case that £3(7-) C £i(r) always. For example, it is easy to 
see that if p is a prime, then p + 1 cannot be the third-largest denominator in an Egyptian 
fraction representation of r = 1/p + l/(p + 1), and so p + 1 is an element of £3(7") (hence 
of £2(7") as well) but not £i(r). (On the other hand it seems likely that £2(1) C £i(l), for 
instance, although this doesn't seem trivial to show.) We can always convert an Egyptian 
fraction representation of r whose largest denominator is some integer n into one whose 
fourth-largest denominator is n, by repeatedly splitting the term with largest denominator 
other than n and examining the various ways in which the term 1/n could be duplicated 
under this process. In this way one can show that £4(7") C £1(7*) for every r. It might be 
interesting to try to classify the rational numbers r for which £2(7") (jL £i(t"). 

We now return to the task of establishing Lemma One possibility would be to cite an 



existing result on smooth numbers in arithmetic progressions in which the modulus of the 
progression was allowed to exceed the smoothness parameter, such as a theorem of Balog 
and Pomerance QXJ] , and remove those numbers divisible by large prime powers in an ad hoc 
manner. We prefer to provide a self-contained proof of Lemma one that nevertheless has 
ideas in common with the method of Balog and Pomerance, including a reliance on estimates 
for incomplete Kloosterman sums. 

The following lemma is still much stronger than we need but nearly the least we could 



prove to establish Lemma [19]. The author would like to thank Henryk Iwaniec for a helpful 
conversation concerning the proof of this lemma. 

Lemma 20. Let k be a positive integer and e and x positive real numbers such that /c 6 / 7+e < 
x < k. The number of ordered pairs (m, n) of coprime positive integers less than x such that 
mn = — 1 (mod k) is 



6x 2 yr / p 

w 2 k ^p 



nM^+o^n (28) 



In particular, such ordered pairs (m,n) exist when k is sufficiently large in terms of e. 
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Of course, Lemma pi] follows by setting e = 1/14, say, choosing x such that k 13 ^ 14 < x < 
k log -24 k, and letting K = mn where (m, n) is one of the pairs whose existence is ensured 
by Lemma when k is sufficiently large. The restriction that m and n be coprime causes a 
little more trouble than one might think; without it, one could simplify the proof and easily 
obtain an error term of xk~ l l A+£ in the asymptotic formula whence x could be taken 
as small as fc 3 / 4+e in the hypotheses of the lemma. In any case, even a hypothesis as weak 
as x > k 1 ' 5 would be ample for our needs, so we have not gone to great lengths to make the 
error terms as small as possible in the proof. 



Proof of Lemma BO: All of the constants implicit in the -C and O-notation in this proof 



may depend on e. We use d(n) to denote the number of divisors of n and note that d{n) <C n e 
for any positive e. We also recall that ||a;|| denotes the distance from x to the nearest integer 
and that ek(x) = e 2mx ^ k . 

Let N denote the number of ordered pairs (m, n) of coprime positive integers less than x 
such that mn = — 1 (mod k), so that 

n= EE i= EE E /*(<*) = EM<0 E E i 

m<xn<x m<x n<x d\(m,n) d<x m<x/dn<x/d 

(m,n)=l mn=-\ (mod k) mnd 2 =-l (mod k) 

mn=— 1 (mod k) 

by changing m and n to md and nd, respectively. Let y and z be real numbers to be specified 
later subject to 1 < z < y < x, and write N = Ni + 0(N 2 + N 3 ) where 

tfi = £Md) EE 1 ; 

d<z m<x/dn<x/d 

mnd?=—l (mod k) 

^= E EE 1 ; 

z<d<y m<x/dn<x/d 

mnd 2 =—l (mod k) 

^3= E E E i< E E E i- 

y<d<x m<x/dn<x/d m<x/yn<x/y y<d<x 

mnd? = -l (mod fc) mnd 2 = -l (mod fc) 

We begin by bounding N 2 and AT 3 . The estimation of iV2 is trivial: writing I = mn we 
have 

n, < e E rf « « (* 2 > e/2 E (i + 1) « (5 + »)*■■ 

ld 2 = -l (mod fc) 

As for A^, we use Cauchy's inequality to write 

Ni< x ^ E E ( E iV = $ E E E E i- 

" m<x/y n<x/y V y<d<x / " m<x/yn<x/y y<di<xy<d2<x 

mnd 2 =-l (mod fc) mnd 2 =-l (mod fc) 

mnd|=— 1 (mod fc) 

The congruence conditions imply that m and n must be coprime to k and thus that d\ = 
c?2 (mod fc). Therefore we can weaken the conditions on the variables d\ and d 2 and make 
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them independent of m and n, yielding the upper bound 

N i<^ E E E E i<£ E #{i<di,<fe<*:d?-<£ = i} 

" m<x/yn<x/y y<d 1 <xy<d 2 <x " |i|<x 2 (29) 

d^=d^ (mod fc) fe|i 

by writing / = d\ — d\. The term Z = contributes , while the remaining terms contribute 
at most d(|/|) each, since d\ and <i 2 are determined by d\ + c?2 and d\ — d 2 which must be 
complementary divisors of I. Because \l\ < x 2 < k 2 , we see that d(\l\) <C k £ . Since the 
number of terms with |/| ^ in the final sum in equation (p9|) is <C x 2 /k < x, the sum is 
bounded by 0(xk e ), and so equation (^) gives us 

< 



N 3 « (-**') 

We have thus shown that 



x ' , A i/2 ^ x b l 2 k e 

t ' y 2 



2 5/2 

^ = ^i + 0((^ + y+^)4 (30) 

It remains to evaluate N±. Using the additive characters Ck{x) to detect the condition 
mnd 2 = — 1 (mod k), or equivalently (dm, A;) = 1 and m + nc/ 2 = (mod k), we have 

Ni= E M<0 E E ^ E e fc (Mm + nrf 2 )) 

d<2: m<x/d n<x/d h (mod fc) 

(d,fc)=l (m,fc)=l 



£ E V(d) Y ( E e fc (/im)jf Y e k (hnd 2 

d<z h (mod k) \ m<x/d / \n<x/d 

(d,k)=l (m,k)=l 



(31) 



The terms with h = (mod k) contribute 

1 

k 



I E Kd)( E i)(Ei) = | E Mrf)(^ + o(F/ 2 ))Q + o ( i) 

^ d<z \ m<x/d / \n<x/d I d<z Uft/ U 



/d / \ n<x/d 
(d,k)=l (m,fc)=l (d,fc)=l 



X 2 0(fc) M<0 (n/ X \ - 1" 

fc 2 ^ d 2 U W2 ^ d 

d<z \ d<z / 

(d,k)=l (d,k)=l 

2 m } 
k 2 



Both error terms are 0(x 2 /zk 1 £ ), and so equation ( [3~T1) becomes 

(d,fc)=l 

where we have defined 

T{d)= Y, [ E e k (hfn))( Y e k (hnd 2 )). 

h^O (mod k) \ m<x/d I \n<x/d ) 

(m,k)=l 



24 



GREG MARTIN 



To estimate the T(d), we make use of the elementary bound 

hd 2 



k 



e k (hnd 2 ) < 

n<x/d 

when k does not divide hd 2 , and also the Weil bound for incomplete Kloosterman sums (see 
for instance Hooley |7|, Lemma 4 (Section 2.5)]), which gives as a special case 

£ e k {hm)^ e k^l 2 {Kkfl 2 . 



m<x/d 
(m,fc)=l 



It follows that 



hd 2 



k 



(33) 



T(d) « fc 1 /2+ £ /2 £ ^ >A .)l/2 
ft^O (mod fc) 

when ti is coprime to k. Because (d, fc) = 1 we may reindex this sum by replacing hd 2 with 
ft, which doesn't affect the quantity (ft, A;). When 1 < ft < fc/2, we note that ||/i/A;|| _1 = 
— /i)/A;|| _1 = k/h and (ft, k) = (k — ft, k), and so 

/id 2 -i ^ (h,ky/ 2 



E (M) 1/2 

/i^O (mod fc) 



2k £ 

l<fc<fc/2 



ft 



By writing ft = ft,'/ where / = (ft, A;), this last sum is easily seen to be <C c?(/c) log A;, and 
T(d) <C fc 3 / 2+£ follows from this bound and the estimate (|33|). 

Using this estimate for the T(d) in equation ([32]), the asymptotic formula (|30|) for iV 
becomes 



AT 



x 2 (j)(k) 

WW 



no 

pi* 



l \-i 



5/2 s 



+ o((^ + ^' 2 + y + ^K) 



(34) 



We optimize this error term by choosing z = xk 3//4 and y = x 5//6 , in which case the error 
term is O^xk -1 ^ + x 5//6 )fc e ), and the x 5 ^ 6 k £ term dominates since x < k. We also note that 
C(2) = tt 2 /6 and that 

* (fc) -n(i-£r=n(; 



A- 



p fc 



pifc 



p 
+ 1 



and so the main term in (|34j) is the same as the main term in 
lemma. 



This establishes the 



7. The Order of Magnitude of Li(r;x) 

In this section we establish Theorem f|. We recall that Li(r; x) is the number of integers in 
£i(r) not exceeding x, that is, the number of integers not exceeding x that cannot be the 
largest denominator in an Egyptian fraction representation of r. 

Proof of Theorem ^: First we establish the lower bound in the inequality @. Let r 
be a positive rational number and x > 1 a real number, set y = Cx/logx with C > 1 a 
large constant, and suppose that x is so large that all prime divisors of the denominator 
of r are less than y. By Lemma |9|, if n < x is the largest denominator in an Egyptian 
fraction representation of r, it must be true that n is not divisible by any prime larger than 
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y (when C is chosen large enough). In other words, the set £i(r) contains all integers n < x 
such that P(n) > y. The number of integers n < x such that P(n) > y is asymptotic to 
x log log xj logx by Lemma 110, which establishes the required lower bound. 

To establish the corresponding upper bound, we use a method similar to the one used in 
the proof of Theorem |3] to argue that &2( r ) is finite. Let r = a/b be a positive rational 
number, and set / = [r/2,r]. Let x be a positive real number and set x' = x/logx and 
y = x'log _23 x', so that y > xlog~ 24 x. We suppose that x is so large that y > P*(b) and 



x' > max{2/r, X(I)}, where X(I) is the constant described in Lemma 18. Let k be an 
integer such that x' < k < x and P*(k) < y, and define the rational number r' = a'/b' by 
r = r — 1/k. 

Since 1/k < 1/x' < r/2 we see that r' e P, moreover, P*(b') < max{P*(6), P*(k)} < y. 
Therefore we may invoke Lemma ITS] with r' and x' to produce a set £ of positive integers 



not exceeding x' such that J2n€£ l/ n = r ' ■> whence £ U {k} is the set of denominators for an 
Egyptian fraction representation of r with largest denominator k. 

From this argument we deduce that for x sufficiently large, the elements of £j\{r) not 
exceeding x are all contained in {r _1 < n < x'} U {r _1 < n < x: P*(n) > xlog _24 x}, whose 
cardinality is 

logx xloglogx 

•C X + X log < : 

log y log x 



by Lemma ID. This upper bound holds when x is large enough in terms of r as described 
above, but it will hold for all x by adjusting the implicit constant (depending on r) if 
necessary. This establishes the upper bound in the inequality (|B|) and hence Theorem ^. □ 

We remark that we have actually established the inequalities 

(1 + o r (l))a;logloga; ^ T . . (24 + o r (l))a;logloga; 

< LAr; x) < 



log x log x 

We did so by showing essentially that an integer n is in ^i(r) if and only if n can be written 
as n = p u m with m less than a power of logp, though we were unable to pinpoint this power 
other than to show that it lies between l + o(l) and 24 + o(l). With much more care we could 
improve the constant 24 to 3 but no further at present. Nevertheless we speculate that the 
correct power is 1, i.e., that for any fixed r and e > there are only finitely many integers 
n E &i(r) such that, if P*(n) = p u and n = p u m, then m > \og 1+£ p. This would imply 
that the counting function Li(r; x) of those integers that cannot be the largest denominator 
in an Egyptian fraction representation of r is asymptotic to x log log xj log x with leading 
coefficient 1. 
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